Efficient Personalized PageRank Estimation for Many Sources and Many Targets

نویسندگان

  • Daniel Vial
  • Vijay Subramanian
چکیده

Personalized PageRank (PPR) is a measure of the importance of a node in a graph from the perspective of another node (we call these nodes the target and the source, respectively). PPR has been used in many applications, such as offering a Twitter user (the source) personalized recommendations of who to follow (targets deemed important by PPR). Computing PPR at scale is infeasible for networks like Twitter, so many estimation algorithms have been proposed. Here we explore two methods for estimating PPR of many source/target pairs. Our first method is based on the state-of-the-art PPR estimator for a single source/target pair. We extend this method to the cases of many sources and/or many targets by eliminating repeated computations that may occur when using existing algorithms separately for each source or each target. Our second method considers computation of a source's PPR when the PPR for a set of nodes, called the known set, are given. If the known set is chosen appropriately, we can compute the source's PPR on a smaller graph. At a high level, both approaches can be viewed as dimensionality reduction methods in which quantities used to compute PPR are shared among sources and/or targets by exploiting certain graph characteristics. Moving forward, we aim to develop a deeper understanding of how PPR vectors -- a high-dimensional set of objects -- are related. This may in turn help us understand how information is organized in modern networks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Algorithm for Top-k Personalized PageRank Queries with Layered Graphs

In recent years, an efficient method of performing analyses and computations on graph networks, regarding recent and up-to-date data, has been needed due to continuous growth of datasets. Personalized PageRank is one of the most well-known computation methods for graphs. Personalized PageRank computes the relative importance or relevance with respect to a set of given nodes, called start nodes ...

متن کامل

Personalized PageRank Solution Paths

Personalized PageRank vectors used for many community detection and graph diffusion problems have a subtle dependence on a parameter epsilon that controls their accuracy. This parameter governs the sparsity of the solution and can be interpreted as a regularization parameter. We study algorithms to estimate the solution path as a function of the sparsity and propose two methods for this task. T...

متن کامل

Efficient Algorithms for Personalized Pagerank a Dissertation Submitted to the Department of Computer Science and the Committee on Graduate Studies of Stanford University in Partial Fulfillment of the Requirements for the Degree of Doctor of Philosophy

We present new, more efficient algorithms for estimating random walk scores such as Personalized PageRank from a given source node to one or several target nodes. These scores are useful for personalized search and recommendations on networks including social networks, user-item networks, and the web. Past work has proposed using Monte Carlo or using linear algebra to estimate scores from a sin...

متن کامل

Efficient Algorithms for Personalized PageRank

We present new, more efficient algorithms for estimating random walk scores such as Personalized PageRank from a given source node to one or several target nodes. These scores are useful for personalized search and recommendations on networks including social networks, user-item networks, and the web. Past work has proposed using Monte Carlo or using linear algebra to estimate scores from a sin...

متن کامل

Incremental and Accuracy-Aware Personalized PageRank through Scheduled Approximation

As Personalized PageRank has been widely leveraged for ranking on a graph, the efficient computation of Personalized PageRank Vector (PPV) becomes a prominent issue. In this paper, we propose FastPPV, an approximate PPV computation algorithm that is incremental and accuracy-aware. Our approach hinges on a novel paradigm of scheduled approximation: the computation is partitioned and scheduled fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1706.01091  شماره 

صفحات  -

تاریخ انتشار 2017